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ABSTRACT DNA dependent RNA polymerase core subunits app' 
conserved residues at frequencies most closely matched with 
codons at stage 10.4-11.1 in code evolution. An excess of 
acidic residues lowered this estimate, but not by more than 
2.3 stages, with 1 tryptophan (stage 14 addition) in 529 
conserved residues, app' significantly under-represented 
this amino acid, consistent with a cut-off in its residue 
profile before completion of the genetic code. Residue 
profiles in FEN-1 homologs and DNA topoi somerase 1-5' placed 
their origin at stage 11-13. Prokaryote septation protein, 
Ftsz, arose earlier, between stage 8-11. Proteolipid in an 
ATP-driven proton pump served as a marker for cell membrane 
formation. It indicated this event took place near stage 7. 
Cell division, DNA replication and transcription were 
inferred to have originated in a protocell antecedent of the 
last common ancestor. Late-forming residue profiles 
characterized RNA dependent RNA replicase, DNA polymerase, 
reverse transcriptase and ribonucleotide reductase. This 
suggests some early processes, including RNA replication and 
deoxynucleotide synthesis, once depended on catalysts not 
found in extant residue sequences. Early formation of 
topoi somerase I, and enzymes that synthesize and trim RNA- 
DNA hybrids was viewed as evidence for a mixed duplex, with 
linear RNA and DNA strands, in the transition from an RNA to 
DNA genome. 
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Sequence homology in DNA dependent RNA polymerase and some 
accessory proteins, and the presence of a DNA genome in 
cells from each of the three domains of life suggest that 
DNA replication arose in the predi vergence era."*" As 
ribonucleotides produce deoxyribonucleotides, and RNA primes 
lagging strand synthesis during DNA replication and is 
integrated into translation and other cell processes, DNA 

synthesis evidently originated in a riboorganism, equipped 

2-3 

with ribozymes and an RNA genome. This scenario appears 
at variance, however, with the manifest heterogeneity of 
many replication proteins, including DNA polymerase.^ To 
reconcile this with DNA replication in the last common 
ancestor, it is proposed that replication underwent 
substantial change, following separation of Bacteria from 
Archaea and Eukarya. The prospect of gaining an insight 
into the time of origin of DNA replication, from a 
predivergence source, prompted a search for evidence of this 
transition in the residue profile of proteins formed before 
completion of the genetic code. 

DNA synthesis preceding the 'standard code' could be 
expected to involve an ancestral replicase, deficient in 
late-comers to the code. The possibility of chronicling 
events in the pre-di vergence era by means of this approach 
was recognized by Eck & Dayhoff.^ Before the main stages of 
code evolution were known, they proposed that the simple 
residue profile in conserved segments of ferredoxin 
indicated this small protein had evolved when the code was 
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incomplete. Residues in ancestral ferredoxin were recently 
shown to match the codon pattern at the fifth of 18 stages 

o 

identified in code evolution, in addition, conserved 

residues in the proteolipid subunit of a membrane proton 

pump (H"'"-ATPase) were demonstrated to match codons at stage 

7. This supported the proposition that the first cells, with 

a proteo-phosphol i pi d membrane, formed about midway through 

code evolution. The origin of DNA replication is now 

examined. It will be seen that this process can be traced to 

a protocell antecedent of the last common ancestor. 

In this investigation, proteins whose phylogenetic 

distribution establish that they formed before divergence 

from the last common ancestor are ordered in time, through 

comparing features in their residue profile with those 

expected at identified stages of code evolution. These 

stages were established after recognizing that the code had 

coevolved with amino acid biosynthetic pathways, which 

initially branched from the citrate cycle, with path lengths 

measured from this cycle, distinct types of amino acids were 

8 + 

seen to enter the code at different stages : NH^ fixers 
(Asp, Glu, Asn, Gin), with paths of only 1 or 2 reactions, 
preceded small, increasingly hydrophobic molecules (Ala, 
Val , Pro, Ser, Gly, Cys, Thr, Leu, lie. Met), with paths of 
4 to 7 reactions, and they preceded residues bearing large 
basic or aromatic side chains (Arg, Lys, Phe, Tyr, His, 
Trp) , whose paths extend for 9 to 14 reactions. A set of 
codon patterns resulted that clarified long- recognized 



5 

regularities in the code.^ They also matched the 
distribution of conserved residues in predivergence 
proteins.^ 

Two features of the residue profile were used to determine 
when an archaic sequence formed in the predivergence 
interval. First, the stage (path length) of its most advanced 
amino acid was noted, where a sequence originated before 
completion of the code, its residue profile will exclude 
amino acids added to the code at a later stage. A sizeable 
difference in path length between the most advanced residue 
in a conserved sequence and Trp (stage 14) substantially 
decreases the probability that a sequence formed after 
completion of the genetic code, 

O = (1 - f)"^ , f = I f ■ , < f < 1 

i=n+l ^ 



O corresponds to the fraction of sequences containing N 
residues, where none have a path length exceeding n 
reactions, f^- is the expected frequency of residues with 
path length i. Code evolution effectively advanced by 1 
stage, with addition of 1 reaction to amino acid 
biosynthesis; each stage being estimated to require a 
notional interval of 10^ yr.^ 

A second indicator of protein 'age' relied on goodness of 
fit between residue frequencies and codon pattern at a given 
stage of code evolution. Agreement between them will tend to 
a maximum at the stage of origin, when selection 
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independently targeted a sufficient number of residues. 

Present determinations involved a non-parametric measure, 

the Kolmogorov-Smi rnov test. A single peak in goodness of 

fit, or an s-shaped curve resulted. They portray, 

respectively, sequence formation before and after code 

completion, stage 14 effectively marked completion of the 

code. Subsequent changes were minor and produced 

comparatively small shifts in goodness of fit. with residue 

10 

frequencies in 37 monomeric proteins as reference, 
residues in predi vergence sequences could be demonstrated to 
broadly approach the residue profile of a standard protein, 
as their time of origin advanced in relation to code 
evolution , 

p = 0.0307 + 3.08xlO"^S 

where pis the probability that standard and sample 
distributions respresent the same underlying distribution 
and S signifies stage of origin; regression goodness of fit, 
r2, was 0.14. 

Some peak broadening accompanied advancement of the origin 
toward its upper detectable limit, at stage 14. An expansion 
of the sequence fixation interval would be anticipated, as 
the number of active-site residues increased in proteins of 
growing size and complexity. The tempo of change would also 
slow as variant formation decreased, with improved 
replication fidelity. A departure from 'one tRNA species per 
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amino acid species', in the early code, would further 
obscure goodness of fit, following appearance of 
i soacceptors . Results from analysis of residue profiles were 
consistent with interspecies sequence homology. Residue 
profiles found to have evolved before completion of the 
genetic code occurred in proteins with wide phylogenetic 
di stri bution . 

Figure 1 shows conserved residues in several proteins, 
distributed by their stage of entry to the genetic code. The 
proteins include prebiotic catalysts and DNA specific 

Figure 1 

enzymes arr ranged with an expanding residue range. Residues 
up to stage 5 occur in the protein ancestral to ferredoxin 
from the anaerobe, Clostridium pasteurianum . ^ and to stage 7 
in the proteolipid subunit of H"'"-ATPase (related to V- 

11 

ATPase) from the crenarchaeote , Sul folobus acidocaldarius . 
It is extremely unlikely that a residue sequence with a 
stage 5 cut-off, as in ferredoxin, could form from the 

o 

complete (stage 14) code or a standard set of residue 
frequencies'*"^; 6.55x10"^ ^ P ^ 1.64x10"^, the probability 
limits refer, respectively, to code and protein 
distributions. A stage 7 cut-off in the proteolipid residue 

sequence also departs significantly from the reference 

-2 -2 
frequencies; 3.75x10 < P < 6.50x10 . Other proteins in 

Fig. 1 conserve residues of later vintage, consistent with 

formation after ancestral ferredoxin and proteolipid. 
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Goodness of fit between conserved residue frequencies in 
the sequences examined and codon patterns, at each stage of 
code evolution, are given in Fig. 2. Consistent with the 
limited range of its residue profile, residue frequencies 
for ancestral ferredoxin optimally matched the codon pattern 
at 

Figure 2 

stage 5.4 ± 0.31 (stage of best fit (p„^^) ± standard error; 

max 

S.E. = a//n, where a is the standard deviation (half -width 

of peak at p^^^ x 0.606) and n the number of sample points 
max 

forming the peak) Replacement of Val46 by Glul7, which 
coalign after superposition of segments from an early gene 
duplication,^ gives a slightly more archaic sequence than 

o 

that appraised. Goodness of fit with stage 5 codons is then 
reduced, however, from 0.98 to 0.90. Also consistent with 
its residue range (Fig. 1), proteolipid residue frequencies 
matched most closely codons at stage 6.2 ± 0.45. Exclusion 
of charged and polar residues (stage 2 amino acids) from 
proteolipid (Fig. 1) conforms with strong selection forces 
for non-polar residues in forming this prebiotic membrane 
protein. When comparisons between proteolipid residue 
frequencies and codon patterns were restricted to stage 4- 

o 

10, goodness of fit at stage 7 improved from 0.44 to 0.57 
(Fig. 2a). 

Proteolipid formation at stage 7 places it at a turning- 
point in code evolution. Following this event, the first 
positively charged and aromatic amino acids entered the 
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code. The antiquity of H"'"-ATPase ' and i ntramembrane 

location of proteolipid suggests that evolution of the first 

cells, encapsulated by a proteo-phospholipid membrane, 

altered the course of code evolution. Confinement of early 

molecular processes within a cell membrane would free them 

from the restriction to anionic reactants governing a 

8 14 

primordial surface system. ' Ancestral ferredoxin predated 
proteolipid, by present criteria, and a stage 5 residue 
profile (Fig. 1, 2a) ensured the protein was polyanionic. 
The dependence of precellular metabolism on charge 
attraction is further illustrated by the multianionic 
molecules in central metabolism."*"^""*"^ 

Cell division and replication proteins were found to 
originate after stage 7, as anticipated. Prokaryote 
septation protein, Ftsz, occurs in both Archaea and Bacteria 

"^^ establishing it also occurred in their common ancestor. 
Stage 10-11 residues (Lys, Phe, Tyr) were the most advanced 
amino acids conserved by FtsZ(Fig. 1), although they were 
under-represented (9 of 194 conserved residues vs. 6 of 62 
codons; p (binomial) = 2.36x10"^). Omission of His+Trp, 
furthermore, indicates there was an early cut-off in the 
Ftsz residue profile; 2.68x10"^ ^ P ^ 4.45x10"^. Cell 
division may thus be traced back to at least stage 11 during 
code evolution. Evidence of cell division as early as stage 
8.3 ± 0.43 is furnished by the correlation between FtsZ 
residue frequencies and codon patterns (Fig. 2a). These 
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results fix the origin of the septation protein at stage 8- 

11, possibly soon after appearance of the first cells. 

A FEN-1 homolog, DNase IV, and its homologous domain in 

DNA pol I provided the first evidence of DNA synthesis. This 

enzyme targets the RNA-DNA hybrid molecule formed during 

lagging strand synthesis in DNA replication. A structural 

and functional homology between the 5 ' -exonuclease of 

18 

Bacteria and Eukarya places this enzyme in their common 

ancestor. FEN-1 homologs conserved residues up to stage 11 

(Fig. 1). The probability of this cut-off is, 4.18x10 < p 
_i 

< 1.48x10 . Its conserved residue frequencies were most 
compatible with codons at stage 8.7 ± 1.05 (Fig. 2a). These 
results indicate ancestral FEN-1 originated between stage 9- 
11. 

Selection for acidic residues noticeably skewed the 

residue profile of this nuclease toward an earlier codon 

pattern, structural studies on a phage FEN-1 homolog 

revealed that Asp clusters bind divalent cations (Zn, Mn) at 

the catalytic site, in front of a helical arch. Several Asp 

residues also occur in a motif behind the helical arch, 

19 

where they would facilitate product release. Stage 2 
residues (Asp, Glu, Asn, Gin) form the largest group of 
amino acids conserved by FEN-1 homologs (Fig. 1). They 
account for 1/4 of all conserved residues (12/48), with 
acidic residues representing 3/4 of them. This exceeds the 
frequency of stage 2 codons in a stage 9-11 code, where they 

o 

contribute 1/5 to 1/6 of assigned codons. Stage 2-free 
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goodness of fit determinations between conserved residues 

and assigned codons shifted p„^^ to 11.5 ± 0.64. This 

in ax 

advances the estimated time of origin for ancestral FEN-1 by 
2.8 stages, placing it at the upper limit of the previous 
range and near the cut-off in nuclease residue range. 

Consistent with these results, non-conserved residues in 

18 

theS ' -exonuclease domain of DNA pol I included residues to 
stage 13; 2 stages later than for conserved residues. 
Frequencies among 167 non-conserved residues displayed 
maximum goodness of fit for the codon pattern at stage 11.4 
± 0.75; 2.7 stages later than for conserved residues. Both 
parameters reveal that non-conserved sites in the nuclease 
sequence evolved later than its conserved, active sites. 

Conserved sequences in topoi somerase and core subunits of 
DNA dependent RNA polymerase provide further evidence of 
early DNA replication. Topoi somerase 1-5' from 8 bacteria 
species, including thermophiles Thermotoga mari tima and 
Fe rvi dobacte r i um i slandicum . and the thermophilic 
euryarchaeon , Methanococcus jannaschi i . contained residues 

up to stage 11 (Fig. 1) in motifs 1, 7a, b and Zn binding 

20 -2 
motifs 1-3. This cut-off has a probability, 6.22x10 < p 

< 1.88x10"''". Residue frequencies correlated optimally with 

codons at stage 12.1 ± 0.75 (Fig. 2a). Topoi somerase 1-5' in 

reverse gyrase (carboxy domain) of Sulfolobus shibatae Bl2 

21 

and topA gene product of T. mari tima . by comparison, 
conserved residues to stage 13, in motifs 1-10, with 
frequencies that best fit codons at stage 12.5 ± 0.84. These 
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results fix the probable origin of topoi some rase 1-5' 

between stage 11-13. 

Topoi somerase 1-5' is present in both prokaryote domains, 

21 22 

while topoi somerse II occurs in all 3 domains. Both 

topoi somerases clearly originated before divergence of the 

prokaryote domains. Residue sequences in topoi somerase II 

23 

protomers, Pare and ParE, indicate, however, that they 
formed after the standard code (Fig. 2a). As topoi somerase I 
preceded decatanase, according to these results, genes 
apparently consolidated into linear chromosomes, which later 
became circular in the last common ancestor. 

Each core subunit of DNA dependent RNA polymerase exhibits 
a goodness of fit peak in Fig. 2b. Subunit D (a analog, 
subunit binding domain) of archaeons S. acidocaldarius and 
Haloarcul a mari smortui and its eukaryote homologs from 
Saccharomyces cerevisiae . subunits AC40, B44,^^ display 
optimal agreement with codons at stage 9.6 ± 0.75. His is 
its most advanced residue (Fig. 1). A comparable profile 
arises in a of Escherichia coli and B. subtilis polymerases, 
2 5 

albeit pp' binding domain residues predated those for the 
UP-promoter. Among 314 residues, 144 were conserved 
(identity, 0.46). Its most advanced residue being His (stage 
13). a residue frequencies best fit the codon pattern at 
stage 10.4 ± 0.92. The p' subunit of E^ coli and its 
eukaryote analogue in cerevisiae . rpo 21 gene product, 
conserved residues that matched codons at stage 11.1 ± 0.85 
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(Fig. 2b). A single Trp occurred among 142 conserved 
residues, subunit from 

27 

E. col i . rpoB product, and S. cerevi siae rpb2 product, 
likewise, conserved residues that best fit codons at stage 
11.0 ± 0.78 (Fig. 2b). Conserved residues in p extended to 
His (Fig. 1). 

RNA polymerase subunits app' are seen to conserve residues 
with a stage 10-11 frequency distribution. Moreover, the 
single Trp in app' (1/529 conserved residues) significantly 
under-represented this amino acid; 5.16x10"^ < P < 1.67xl0~^ 
. No Trp occurred among conserved residues (0/165) in 
archaeon RNA polymerase subunit D. These findings show RNA 
polymerase originated in a protocell, with a stage 10-13 
code. 

Sequence homology in RNA polymerase of Archaea, Bacteria 

28 

and Eukarya established early that transcription occurred 

29 

in the last common ancestor. Its polymerase residue 

profile now furnishes a probable time of origin for 

transcription within the predi veregence era. As with FEN 1 

homologs, an excess of stage 2 residues skewed estimates 

based on the polymerase residue profile toward an earlier 

origin. Goodness of fit determinations, with stage 2-free 

distributions, advanced p„^^ for each subunit by up to 2.3 

max 

stages. Broadly, this placed the origin of RNA polymerase at 
stage 11-13. This agrees with the profile cut-off. It also 
supports the finding that transcription evolved when the 
genetic code was still incomplete. 
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These stage 2-free goodness of fit determinations 

indicated that (I' was formed first among RNA polymerase 

30 

subunits (p^^^ = 11.7 ± 0.41). Mutational analysis 

max 

formerly revealed that contains binding sites for both 

DNA template and RNA transcript growth point (3 ' -termi nus) . 

31 

Sequence homologies, furthermore, show a likely kinship 
between p' and reverse transcriptase (simian AIDS virus), x 
subunit of DNA polymerase, and a number of other proteins 

that interact with DNA, or RNA. 

32 

DNA pol I, plus-Strand virus RNA dependent RNA 

33 

replicase and prokaryote reverse transcriptase have S- 
shaped goodness of fit curves (Fig. 2b), consistent with 
formation after the standard code. Profile cut-offs observed 
in these proteins were, respectively, stage 13, 11 and 14; 
His+Trp omission from RNA replicase was not significant, 
0.18 < p < 0.36. A late-forming residue profile, with no 
visible goodness of fit peak (Fig. 2a), also occurred in 
type I and II ribonuclease reductase^^ and, as noted, in 
topoi somerase II protomers Pare and ParE (Fig. 2a). They 
conserved residues to stage 13-14. The glycyl radical domain 
(type III) of an E . col i type II ribonuclease reductase, 
with an adenosylcobalamin cof actor, displayed a late- 
forming frequency distribution. It conserved residues up to 
stage 11; this cut-off had probability, 0.09 < p < 0.24. Its 
oxygen sensitivity, and S-adenosylmethi oni ne and FeS cluster 
cofactors favour type III ribonucleotide reductase being 
ancestral to type I and II reductases, whose allosteric 
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control mechanism it shares. Present results support the 
29 35 

view ' that extant reductases were preceded by an earlier 
catalyst. It putatively catalyzed deoxynucleotide synthesis 
before completion of the genetic code. Late-forming 

sequences also characterized the 3'-5' nuclease domain of 

32 

DNA polymerase - Klenow fragment, clamp loading factor RF- 

C - DNA pol T, RNase Hi, and the ribo-proteozymes 

38 39 
telomerase and RNase P. 

This analysis of residue profiles in archaic proteins has 

provided evidence of a predi vergence mechanism of DNA 

replication, significant gaps remain, however. Some late- 

forming replication proteins almost certainly took over 

functions once performed by other molecules, not found in 

the record preserved by extant residue sequences. RNA 

apparently provided the infrastructure for replication and 

translation in pre-code and pre-cellular systems , ^ ' 

yet RNA replicase formed later than the standard code. Its 

formation was presumably retarded by an early reliance on a 

ribozymal replicase. Deoxynucleotide synthesis by a free 

radical reduction mechanism appears beyond the scope of a 

35 

ribozyme. However, a proto-reductase involving glycyl and 
Cys-thiyl free radicals, with adenosyl and FeS-cluster 
cof actors, could, in principle, have formed as early as 
stage 5 in code evolution. Formation of a nuclease specific 
for RNA-DNA hybrids and a DNA dependent RNA polymerase, by 
stage 11-13, conforms with early occurrence of a mixed 
polymer of DNA and RNA strands. Since reverse transcriptase 
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evidently formed after the standard code (Fig. 2b), RNA 
polymerase specificity may have been modified^"*" to catalyze 
RNA dependent DNA synthesis. Early formation of 
topoi somerase I favours a linear molecule as the hybrid 
intermediate in the transition from an RNA to DNA genome. 
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Figure legends 

Fig. 1. Distribution of residues conserved by archaic 
proteins. The range of residues, with respect to the stage 
of addition to the code, is an indicator of when a protein 
formed in the predi vergence era. Residue profiles are shown 
for proteins catalyzing DNA replication, monomer synthesis 
and transcription, and prokaryote cell division. Proteolipid 
subunit of a membrane proton pump from an acidothermophilic 
archaeon and ferredoxin from an anaerobic eubacterium 
exemplify very early, prebiotic proteins. At top, amino 
acids in groups, whose codons were assigned at the same 
stage of code evolution. Transition between each stage 
effectively corresponded to addition of one reaction step to 
an amino acid biosynthetic pathway; path length being 
measured from the citrate cycle. Each stage of code 
evolution was estimated to represent a notional interval of 
10 yr. Frequency of conserved residues in sequences 
examined appear under Fig. 2. 

Fig. 2. Probability curves peak at the stage of code 
evolution with best fit between assigned codon pattern and 
frequency distribution of conserved residues in each archaic 
protein, (a) Prebiotic, cell division, deoxyribonucleotide 
synthesis and accessory proteins, and (b) enzymes in DNA or 
RNA replication, DNA transcription and reverse 
transcription. DNAP, DNA polymerase (pol l-like) catalytic 
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32 

site of Klenow fragment, n = 138, n = 0.67, where n is 



number of sites compared, and n the fraction of sites with 

18 



identical residues. Fdn, ferredoxin of pasterianum . ^ n = 



25, 71 = 0.48. FEN-1, 5 ' -exonucl ease of human and E . col i . 
n = 215, 71 = 0.22. FtsZ, filamentous temperature sensitive 
mutant protein of 8 bacteria and 1 archaeon species and 1 
chloroplast source, ''"^ n = 390, k = 0.49. Pare and ParE, 
partition proteins C and E of Mycoplasma homini and E. col i . 

n = 717, 71 = 0.30 (Pare), n = 602, 71 = 0.37 (ParE). PL, 

proteolipid subunit of H"'"-ATPase of acidocaldarius . n = 

11, 71 = 0.73. RNAP, DNA dependent RNA polymerase subunits p, 

P' and D: p of E. coli . rpoB product, and yeast, rpb2 
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product, n = 342, 71 = 0.42; p' of E . col i and yeast, rpo21 

product, n = 324, 71 = 0.44; D of 2 archaeon species and 

yeast subunits AC40, B44,^^ n = 265, 71 = 0.62. RNAR, RNA 

dependent RNA replicase of 15 plus strand vi ruses, n = 

168, 71 = 0.16. RNR, ribonucleotide reductase of E . col i 

(class I) and I. acidophi la (class II)^^ n = 366, % = 0.24; 

RT, reverse transcriptase of Myxococcus xanthus (msMxl62 , 

msMx65) and E. coli (msEc67, msEcB86),^^ n = 178, 71 = 0.83. 

Topo I, type 1-5' topoi somerase of S. shibatae Bl2 and T. 
21 

mari tima . n = 230, 7i = 0.50. p, probability that residue 
and codon frequency distributions were from same underlying 
distribution, with goodness of fit determined by a 
Kolmogorov-Smi rnov test. 
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